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Introduction: 


The  overall  goals  of  this  proposal  are  to  develop  a  rapid  PCR  assay  to 
detect  the  genome  wide  Long  Interspersed  Sequence  (LIHs)  mediated  instability 
and  to  isolate  target  oncogenes,  tumor  suppressor  genes  and  regions  of 
chromosomal  instability  by  the  use  of  LI  Hs  specific  PCR.  In  order  to  accomplish 
these  goals  we  modified  our  approach  and  took  advantage  of  the  recent  progress 
in  the  human  genome  intiative  and  cancer  genome  anatomy  projects  (CGAP). 

DNA  sequence  analysis  of  a  novel  LI  Hs  element  that  we  characterized  in 
the  laboratory  revealed  a  segment  of  homology  to  MER22  repetitive  elements. 
These  findings  coupled  with  the  rapid  progress  in  the  human  genome  initiative 
led  us  to  redesign  our  approach  to  detect  target  genes.  Thus  we  screened  the 
cDNA  database  dBEST  for  novel  genes  which  contain  repetitive  elements  in 
them.  About  the  same  time,  other  studies  in  our  laboratory  identified  a  partial 
cDNA  clone  for  a  key  regulatory  homeobox  MIX  like  gene  in  the  dBEST 
database.  Interestingly,  the  3’  UTR  of  this  gene  contains  MER  repetitive 
elements.  Furthermore  the  expression  of  this  novel  gene  appeared  to  be  highly 
restricted  with  the  identification  only  3  cDNA  clones  in  the  entire  dBEST 
database  with  more  than  3  million  entries.  Our  interest  in  this  regulatory  gene 
was  further  heightened  as  one  of  the  3  cDNA  clones  was  from  a  laser  captured 
tumor  tissue  from  infiltrating  mammary  adenocarcinoma  in  situ.  All  of  the 
subsequent  studies  detailed  in  this  report  stemmed  from  these  key  observations 
made  in  early  stages  of  the  first  year  of  this  award. 
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Year  1  of  the  proposal  was  to  focus  on  specific  Aim  1 .  The  following  report  summarizes 
what  we  have  accomplished  towards  specific  tasks  in  the  approved  statement  of  work. 

Specific  Aim  I  (i-iii) 

(i)  Development  of  primers  and  conditions  for  IPCR  and  LI  Hs  PCR 

(ii)  Analysis  of  hybridization  patterns 

(iii)  Subcloning  of  unique  fragments 

In  accomplishing  the  tasks  i-iii  of  Specific  Aim  I,  we  have  taken  advantage  of  the 
rapid  progress  made  in  the  human  genome  sequencing  initiative,  the  cancer  genome 
anatomy  project  and  more  importantly  characterization  of  a  novel  LI  Hs  element  in  our 
laboratory.  The  novel  element  designated  LREHL  showed  segments  of  homology  to 
MER  repetitive  elements. 

We  initiated  a  screen  of  the  expressed  sequence  tagged  database  (dBEST) 
electronically  for  transcripts  with  repetitive  sequences.  About  the  same  time,  other 
investigations  in  our  laboratory  were  searching  for  the  human  homolog  of  the  homeobox 
gene  Mix.1,  which  regulates  mesoderm  and  endoderm  differentiation  in  Xenopus 
embryos.  A  single  candidate  EST  clone  (AA84781)  with  a  putative  Mix  like 
homeodomain  was  identified  in  a  library  from  germinal  center  B  lymphoid  cells.  When 
the  EST  clone  AA84781  was  sequenced,  we  identified  a  partial  open  reading  frame.  A 
recursive  search  of  the  dBEST  database  identified  only  two  other  cDNA  clones  for  this 
novel  gene,  suggesting  an  extremely  restricted  pattern  of  expression.  Interestingly,  one 
of  these  ESTs  (AA911377)  was  from  a  cDNA  library  made  from  laser  captured 
infiltrating  ductal  carcinoma  of  the  breast  (Fig.1).  The  significance  of  this  finding  was 
further  enhanced  when  we  detected  alu  and  MER  like  repetitive  elements  in  the  3’  (> 
I.Okbp)  UTR  of  this  gene. 

(iv)  Sequence  analysis  and  development  of  STSs 

We  developed  unique  STSs  from  the  3’untranslated  region  of  the  MIX  like 
homeodomain  gene.  Using  these  primers  we  screened  a  human  genomic  library  in 
Bacterial  Artificial  Chromosome  vector  to  isolate  a  large  segment  of  the  genome 
containing  the  gene.  The  BAC  was  used  to  characterize  the  full  length  open  reading 
frame  of  this  novel  homeogene  that  we  designated  MIXL2.  The  gene  encodes  an  open 
reading  frame  of  232amino  acids.  Similar  genes  whose  expression  is  restricted  to  the 
primitive  streak  have  been  identified  in  mice  and  chicken  (1-3).  MIXL2  shows  94% 
identity  in  mix-like  domain  with  an  overall  identity  of  69%  to  the  murine  gene  MML 
(Fig. 2).  With  the  chicken  gene  CMIX,  we  find  a  79%  identity  in  the  homeodomain  and 
an  overall  identity  of  41%.  Identification  of  the  three  distinct  and  yet  highly  related 
genes  suggests  the  existence  of  a  novel  mix-like  homeobox  gene  family. 

Additionally,  we  characterized  the  genomic  organization  and  expression  pattern 
of  MIXL2.  The  gene  consists  of  two  exons  (Fig.  3)  and  shows  a  highly  restricted 
expression  pattern.  None  of  the  breast  cancer  cell  lines  tested  showed  abundant 
transcripts.  However,  reverse  transcription  coupled  PCR  detected  MIXL2  transcripts  in 
the  cell  line  MDA-MB  453  and  not  in  the  immortalized  breast  cell  line  MCF10A. 
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Simultaneously,  we  fine  mapped  the  novel  gene  to  human  genome  using  DNA 
from  a  panel  of  well-characterized  radiation  hybrids.  Interestingly,  the  MIX  like  gene 
localized  to  human  chromosome  1q32-41  locus,  between  the  microsatellite  loci  D1S479 
and,  a  region  of  gain  in  breast  cancer  (Fig.4). 

(v)  Development  of  the  PCR  for  minimal  material 

In  collaboration  with  Dr.  Nour  Sneige  in  the  Dept,  of  Pathology,  we  dissected  25 
breast  tumors  and  isolated  normal  and  tumor  DNAs.  The  status  of  D1S479  was 
examined  in  these  paired  samples  by  polymerase  chain  reaction  (PCR)  with  end 
labeled  primers.  Interestingly,  12  out  of  25  tumors  showed  allelic  imbalance  for  D1S479 
suggesting  that  the  MIXL  gene  was  a  candidate  target  gene  of  chromosomal  gain  in  1q 
(Fig.  5). 

(vi)  Correlation  of  data  from  tasks  (i-v)  with  stage  and  grade 

All  of  the  25  breast  tumors  used  to  screen  for  allelic  imbalance  of  D1S479 
showed  lymph  node  involvement.  Studies  are  underway  to  determine  whether  there  is 
intra-tumoral  heterogeneity  within  for  gain  of  1q32-41  loci.  We  will  utilize  both  FISH  and 
MIXL  specific  antibody  based  immunofluorescence  staining  approaches  to  examine 
such  a  possibility. 
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Key  Research  Accomplishments: 


•  Identification  of  a  novel  regulatory  homeobox  gene  with  repetitive  elements  in 
the  3’UTR 


•  Isolation  and  characterization  of  the  full  length  open  reading  frame  for  the 
novel  gene  designated  MIXL2 


•  Localization  of  MIXL2  to  1q41 ,  a  region  of  gain  in  breast  cancer 


•  Detection  of  allelic  imbalance  for  D1S479,  a  marker  physically  linked  to 
MIXL2,  in  12/25  breast  tumors  with  lymph  node  involvement 
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Reportable  outcomes: 

Manuscripts  in  preparation: 

1.  Hejlik  DP  and  Nagarajan  L.  Retrotransposition  and  recombination  of  LINE-1 
Elements  as  a  mechanism  of  chromosomal  deletion  and  genomic  instability 


2.  Guo  W.,  Ma,  J.,  Liang  H.,  and  Nagarajan  L.  A  novel  MIX  like  paired  type 
homeobox  gene  localizes  to  a  region  of  in  breast  cancer 


3.  Guo  W.,  Chan  A.,  Ma  J.,  Liang  H.,  Etkin  L  and  Nagarajan  L.  MIXL2  shows 
evolutionary  and  functional  conservation:  a  role  in  hematopoiesis 


Development  of  FISH  and  immunohistochemical  staining  probes: 

We  have  characterized  a  BAC  probe  that  can  be  used  in  Fluorescence  in  situ 
hybridization  (FISH). 

We  have  developed  antibodies  to  MIXL2,  which  can  be  used  in  immuno¬ 
histochemical  staining. 


Informatics: 

The  cDNA  sequence  for  MIXL2  has  been  submitted  to  human  genome 
organization.  The  full  length  sequence  will  be  submitted  to  Genbank. 


Funding  applied  for  based  on  work  supported  by  this  award: 

Application  by  Wei  Guo  entitled  “Functional  characterization  of  a  novel  embryonic 
homeobox  gene  MIXL2  in  breast  cancer"  for  Graduate  training  fellowship,  from 
the  Breast  Cancer  Research  Program,  Department  of  Defense  Submitted  on 
June  7,  2000. 
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Conclusions: 


A  genome  wide-electronic  screening  approach  yielded  us  clues  on  an  important 
regulatory  homeobox  gene  with  repetitive  elements  in  the  3’  UTR. 

The  MIXL2  gene  is  highly  restricted  in  its  expression.  There  are  only  3  cDNA 
clones  for  this  gene  in  the  entire  human  cDNA  expressed  sequence  tag  (EST) 
database  that  contains  more  than  3  million  entries.  Interestingly,  one  of  the  three 
ESTs  for  MIXL2  is  from  a  laser  captured  infiltrating  ductal  carcinoma  in  situ  of  the 
breast. 

The  MIXL2  gene  consists  of  two  exons  and  encodes  an  open  reading  frame  of 
232  amino  acids. 

We  localized  it  to  a  region  of  chromosomal  gain  (1  q41 )  in  breast  cancer. 

Interestingly  D1S479,  a  marker  that  is  physically  linked  to  MIXL2  is  gained  in  12 
out  of  25  breast  tumors  with  lymph  node  involvement. 

Homeobox  genes  encode  a  family  of  regulatory  transcription  factors  that  primarily 
play  a  crucial  role  in  development.  Other  evidences  suggest  that  certain 
homeobox  genes  provide  an  important  link  between  the  processes  of  cell  cycle 
control,  development  and  cancer. 

Future  experiments  will  identify  the  functional  consequence  of  gain  of  MIXL2 
gene  in  breast  cancer  as  well  as  characterization  of  other  novel  regulatory  genes 
with  LI  repetitive  elements. 


References: 


1.  Pearce  JJ,  Evans  MJ.  Mm!,  a  mouse  Mix-like  gene  expressed  in  the 
primitive  streak.  Mech.  Dev.  87:189-192,1999. 

2.  Stein  S,  Roeser  T,  Kessel  M.  CMIX,  a  paired-type  homeobox  gene 
expressed  before  and  during  formation  of  the  avian  primitive  streak.  Mech. 
Dev.  75:163-165,1998. 

3.  Peale  FV  Jr,  Sugden  L,  Bothwell  M.  Characterization  of  CMIX,  a  chicken 
homeobox  gene  related  to  the  Xenopus  gene  mix.1.  Mech.  Dev.  75: 
167-170,1998. 
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EST  clone  shows  a  mix-like  homeodomain.  The  complete  sequencing  of  the  EST  clone  allows  us  to  retrieve  the  other  two 
EST  clones  from  different  cDNA  libraries.  The  detection  of  only  three  EST  clones  suggests  a  highly  restricted  expression 
of  MIXL2. 


Homology  between  human  MIXL2  and  mouse  MML 
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Figure  2.  Homology  between  human  MIXL2  and  mouse  MML.  MML  is  a  mix-like  gene  in  mice.  MIXL2  shows  94%  identity 
on  the  mix-like  domain  and  overall  identity  of  69%  with  mouse  MML.  The  carboxy  terminal  acidic  motif  is  also  conserved. 
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Figure  4.  Genomic  localization  of  MIXL2.MIXL2  is  localized  to  human  chromosome  lq32-41  by  radiation  mapping  and  FISH 
(data  not  shown).  PCR  confirms  that  MIXL2  can  hit  YAC750G7  in  the  contig  Whitehead  Institute  Contig  WC-516  of  Chrlq.  The 
BAC216I1 1  was  retrieved  from  human  BAC  library  (Research  Genetics,  Inc)  by  PCR  screening.  One  end  of  BAC216I1 1  also  hit 
YAC750G7 
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